Lexical knowledge and human disagreement on a WSD task
نویسندگان
چکیده
This paper explores factors correlating with lack of inter-annotator agreement on a word sense disambiguation (WSD) task taken from SENSEVAL-2. Twenty-seven subjects were given a series of tasks requiring word sense judgments. Subjects were asked to judge the applicability of word senses to polysemous words used in context. Metrics of lexical ability were evaluated as predictors of agreement between judges. A strong interaction effect was found for lexical ability, in which differences between levels of lexical knowledge predict disagreement. Individual levels of lexical knowledge, however, were not independently predictive of disagreement. The finding runs counter to previous assumptions regarding expert agreement on WSD annotation tasks, which in turn impacts notions of a meaningful ‘‘gold standard’’ for systems evaluation. 2004 Elsevier Ltd. All rights reserved.
منابع مشابه
USYD: WSD and Lexical Substitution using the Web1T corpus
This paper describes the University of Sydney’s WSD and Lexical Substitution systems for SemEval-2007. These systems are principally based on evaluating the substitutability of potential synonyms in the context of the target word. Substitutability is measured using Pointwise Mutual Information as obtained from the Web1T corpus. The WSD systems are supervised, while the Lexical Substitution syst...
متن کاملAn investigation and comparison of lexical knowledge of deaf and hearing children
Abstract Objectives: The present study examines the lexical knowledge of deaf children in two age groups of 9-10 and 10-11 years old with two groups of normal hearing children of 9-10 and 10-11 years old. Method: This study is a casual-comparative study. The achievement of 16 deaf children (ages 9-10 and 10-11 years old) and 16 hearing children (ages 9-10 and 10-11 years old) were examined on...
متن کاملPre-emption with or without Pre-task Planning: A Probe into L2 Lexical Diversity
The current study, setting a two-fold goal, attempted to see whether the preemptive focus on form (FonF) under either planned or unplanned conditions could contribute to increasing lexical diversity in written narratives and, second, to find whether there was a trade-off between the lexical diversity and accuracy. To this end, 32 beginner learners were selected following a Quick Oxford Placemen...
متن کاملHIT-WSD: Using Search Engine for Multilingual Chinese-English Lexical Sample Task
We have participated in the Multilingual Chinese-English Lexical Sample Task of SemEval-2007. Our system disambiguates senses of Chinese words and finds the correct translation in English by using the web as WSD knowledge source. Since all the statistic data is obtained from search engine, the method is considered to be unsupervised and does not require any sense-tagged corpus.
متن کاملCOLEPL and COLSLM: An Unsupervised WSD Approach to Multilingual Lexical Substitution, Tasks 2 and 3 SemEval 2010
In this paper, we present a word sense disambiguation (WSD) based system for multilingual lexical substitution. Our method depends on having a WSD system for English and an automatic word alignment method. Crucially the approach relies on having parallel corpora. For Task 2 (Sinha et al., 2009) we apply a supervised WSD system to derive the English word senses. For Task 3 (Lefever & Hoste, 2009...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Speech & Language
دوره 18 شماره
صفحات -
تاریخ انتشار 2004